首页> 外文OA文献 >Transfer Learning for Video Recognition with Scarce Training Data for Deep Convolutional Neural Network

【2h】

Transfer Learning for Video Recognition with Scarce Training Data for Deep Convolutional Neural Network

机译：利用稀缺训练数据进行视频识别的转移学习深度卷积神经网络

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Unconstrained video recognition and Deep Convolution Network (DCN) are twoactive topics in computer vision recently. In this work, we apply DCNs asframe-based recognizers for video recognition. Our preliminary studies,however, show that video corpora with complete ground truth are usually notlarge and diverse enough to learn a robust model. The networks trained directlyon the video data set suffer from significant overfitting and have poorrecognition rate on the test set. The same lack-of-training-sample problemlimits the usage of deep models on a wide range of computer vision problemswhere obtaining training data are difficult. To overcome the problem, weperform transfer learning from images to videos to utilize the knowledge in theweakly labeled image corpus for video recognition. The image corpus help tolearn important visual patterns for natural images, while these patterns areignored by models trained only on the video corpus. Therefore, the resultantnetworks have better generalizability and better recognition rate. We show thatby means of transfer learning from image to video, we can learn a frame-basedrecognizer with only 4k videos. Because the image corpus is weakly labeled, theentire learning process requires only 4k annotated instances, which is far lessthan the million scale image data sets required by previous works. The sameapproach may be applied to other visual recognition tasks where only scarcetraining data is available, and it improves the applicability of DCNs invarious computer vision problems. Our experiments also reveal the correlationbetween meta-parameters and the performance of DCNs, given the properties ofthe target problem and data. These results lead to a heuristic formeta-parameter selection for future researches, which does not rely on the timeconsuming meta-parameter search.

机译：不受约束的视频识别和深度卷积网络（DCN）是计算机视觉中的两个活跃主题。在这项工作中，我们将DCN作为基于帧的识别器应用于视频识别。但是，我们的初步研究表明，具有完整基础事实的视频语料库通常规模不大且不够多样化，无法学习可靠的模型。直接在视频数据集上训练的网络存在严重的过拟合现象，并且在测试集上的识别率很低。相同的训练样本不足问题限制了深度模型在难以获得训练数据的各种计算机视觉问题上的使用。为了克服这个问题，我们进行从图像到视频的学习转移，以利用弱标签图像语料库中的知识进行视频识别。图像语料库有助于学习自然图像的重要视觉模式，而仅在视频语料库上训练的模型会忽略这些模式。因此，所得网络具有更好的通用性和更好的识别率。我们表明，通过从图像到视频的转移学习，我们可以仅使用4k视频学习基于帧的识别器。由于图像语料库的标记较弱，因此整个学习过程仅需要4k个带注释的实例，这远远小于以前工作所需的百万级图像数据集。相同的方法可以应用于仅缺乏培训数据的其他视觉识别任务，并且可以提高DCN各种计算机视觉问题的适用性。考虑到目标问题和数据的属性，我们的实验还揭示了元参数与DCN性能之间的相关性。这些结果导致了启发式的形式参数选择，以供将来的研究之用，它不依赖费时的元参数搜索。

著录项

作者
Su, Yu-Chuan; Chiu, Tzu-Hsuan; Yeh, Chun-Yen; Huang, Hsin-Fu; Hsu, Winston H.;
展开▼
作者单位

展开▼
年度 2015
总页数
原文格式 PDF
正文语种 {"code":"en","name":"English","id":9}
中图分类

相似文献

外文文献
中文文献
专利

1. Application of Deep Convolutional Neural Networks in Attention-Deficit/Hyperactivity Disorder Classification: Data Augmentation and Convolutional Neural Network Transfer Learning [J] . Zhu Li, Chang Weike Journal of Medical Imaging and Health Informatics . 2019,第8期

机译：深度卷积神经网络在注意力缺陷/多动障碍分类中的应用：数据增强与卷积神经网络转移学习
2. DeltaFrame-BP: An Algorithm Using Frame Difference for Deep Convolutional Neural Networks Training and Inference on Video Data [J] . Bing Han, Kaushik Roy Multi-Scale Computing Systems, IEEE Transactions on . 2018,第4期

机译：DeltaFrame-BP：一种使用帧差的算法，用于深度卷积神经网络训练和视频数据推断
3. Intelligent Image Recognition System for Marine Fouling Using Softmax Transfer Learning and Deep Convolutional Neural Networks [J] . Chin C. S., Si JianTing, Clare A. S., Complexity . 2017,第1期

机译：基于Softmax转移学习和深度卷积神经网络的船舶污垢智能图像识别系统。
4. Moving Object Detection for Video Satellite Based on Transfer Learning Deep Convolutional Neural Networks [C] . Zhenguo Yan, Xin Song, Hanyang Zhong, International Conference on Pattern Recognition Systems . 2020

机译：基于转移学习深度卷积神经网络的视频卫星移动对象检测
5. DeepFakes Detection in Videos Using Feature Engineering Techniques in Deep Learning Convolution Neural Network Frameworks [D] . Burroughs, Sonya. 2021

机译：使用深度学习卷积神经网络框架的特征工程技术在视频中检测视频
6. Automatic Pharyngeal Phase Recognition in Untrimmed Videofluoroscopic Swallowing Study Using Transfer Learning with Deep Convolutional Neural Networks [O] . Ki-Sun Lee, Eunyoung Lee, Bareun Choi, 2021

机译：利用深度卷积神经网络的转移学习自动咽期吞咽研究中的自动咽部识别研究
7. Automatic Pharyngeal Phase Recognition in Untrimmed Videofluoroscopic Swallowing Study Using Transfer Learning with Deep Convolutional Neural Networks [O] . Ki-Sun Lee, Eunyoung Lee, Bareun Choi, 2021

机译：利用深度卷积神经网络的转移学习，自动咽期吞咽研究中的自动咽部识别研究

Transfer Learning for Video Recognition with Scarce Training Data for Deep Convolutional Neural Network

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅